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Abstract. We derive a new result for exponential approximation using 
Stein's method of exchangeable pairs. As an application, an exponential 
limit theorem with error term is derived for \Tr(U)\ 2 , where Tr(U) de- 
notes the trace of a matrix chosen from the Haar measure of the unitary 
group U(n, C). 



1. Introduction 

The purpose of this paper is to further develop exponential approximation, 
using Stein's method of exchangeable pairs. The first use of exchangeable 
pairs in exponential approximation was in the paper which studied the 
spectrum of the Bernoulli-Laplace Markov chain. Unfortunately the results 
in [J], which use the Kolmogorov metric, are very complicated and seem 
quite hard to apply in other examples. We provide approximation results 
similar to those in [l|, but which are significantly easier to compute. In 
particular, we do not see how to apply the results of [l| to the example in 
this paper. 

We work in a "smooth" test function metric, but also provide bounds in 
the Kolmogorov metric, which is defined for random variables W and Z to 
be 

d K (W, Z) = sup \¥{W <t) - P(Z <t)\. 

tER. 

Our main theoretical result is the following theorem (see also Theorem 12.11 
below) . 

Theorem 1.1. Let Z be a mean one exponential random variable. IfW>0 
is a random variable with finite second moment and (W, W) is an exchange- 
able pair such that for some a > and sigma-field T D o~(W), 

(1) E[W' - W\F] = -a(W - 1) + R, 

then for all S > 0, 

E[(W- W) 2 \F] 



d K (W,Z) < -E 



W 



2a 



+ -\EW- II 
o 



'(5-6/e) 3\ E\W'-W\ 3 8E\R\ 
+ + W a + -5— +5/2 - 
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Remark: The theorem is stated with a choice of 5 in order to simplify the 
error bound, but it is obvious that in applications 5 should be chosen to 
minimize the bound. 

One of the attractive points about this result is that the terms are very 
similar to those which one encounters in normal approximation. To see the 



parallels, here is a normal approximation theorem of Rinott and Rotar 24] 
(in the Kolmogorov metric). 

Theorem 1.2. Let (W, W) be an exchangeable pair of real random variables 
such that E(W) = 0,E(Ty 2 ) = 1 and E{W'\W) = (1 - a)W + R{W) with 
< a < 1. Then for all real xq, 

1 f X ° x 2 

P(W < xq) - —= / e~ — dx 



< -JVar(E[(W - W) 2 \W]) + 19 ^-B5 + Q x -E\W - W\ 3 . 
a a V a 

For normal approximation, there are algebraically natural examples re- 
lated to Markov chain spectra and random matrices (see [8], [9(]), which are 
perfectly suited for the computation of terms such as Far(E[(iy' — P^) 2 |iy]) 
and \W - W\ 3 . This is why we believe the bound in Theorem 11.11 will be 
useful for exponential approximation. 

Indeed, in ISection "31 we consider the random variable W = \Tr(U)\ 2 , 
where Tr denotes trace and U is from the Haar measure of the unitary 
group U(n, C). Since Tr(U) converges to a complex normal [a], it follows 
that \Tr(U)\ 2 converges to an exponential with mean 1. In studying the 
correspondence between unitary eigenvalues and zeros of the Riemann zeta 
function, it is conjectured in [J] that the convergence of \Tr(U)\ 2 to an 
exponential limit is extremely rapid, more precisely that there are positive 
c, 5 such that for all n > 1, t > 0, 

\F(\Tr{U)\ 2 > i)-e _t | < m^ n '. 

The authors suggest that this should follow from methods in Johansson's 



remarkable paper [14j ]. This seems challenging to make rigorous, particularly 
if one wants to make c, 5 explicit. In ISection 3\ we give the first rigorous and 
explicit error term for this problem, proving that the Kolmogorov distance 
between \Tr(U)\ 2 and a standard mean 1 exponential is at most 2 9//4 /- v /n. 
Another approach to this result might be to use the multivariate central 
limit theorems in [7j; we do not pursue this here. 

To close the introduction, we mention some related results using Stein's 
method for exponential approximation. Aside from [H which we already 
mentioned, a recent paper of Chatterjee and Shao [3] (and similar results in 
Section 13.4 of the text [1]), use Stein's method of exchangeable pairs for 
exponential approximation. However our approach is quite different than 
theirs since they assume the exchangeable pair (W, W') satisfies 

E(W' - W\W) = 1/co + R(W), 

with Co a positive constant, rather than the linearity condition ([1]) assumed 
here. Another approach to Stein's method for exponential approximation is 
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the "equilibrium distribution" coupling for which we refer the reader to the 
papers by Pekoz and Rollin [H], [3], and to the references therein. For the 
generator method (in the more general context of chi-squared approxima- 
tion), one can consult [l7| or [z| ■ We note that the examples in [ItJ and 
[22| are about independent random variables, whereas the example in the 
current paper involves dependence. Finally, the introductory survey [2^] has 
some discussion of these approaches in the wider context of Stein's method. 



2. General theorem 

The purpose of this section is to prove Theorem 11.11 from the introduc- 
tion. We first prove an intermediate result which can be thought of as an 
approximation result for "smooth" test functions. 

Theorem 2.1. Let Z be a mean one exponential random variable. IfW > 
is a random variable with finite second moment and (W, W) is an exchange- 
able pair such that for some a > and sigma-field T 5 o~(W), 

(2) E[W' - W\F] = -a(W - 1) + R, 

then for all twice differentiable functions h with \\h'\\, \\h"\\ < 00, 

E[(W' -W) 2 \F] 



(3) \Eh{W)-Eh(Z)\ < 4||/t'||E 



W 



2a 



+ ||/i'|||EW-l| 



,4) +{2( ,. 6/em+nhl) m^Ei +m m, 

The proof of Theorem 1 2 . 1 1 roughly follows the usual development of Stein's 
method of exchangeable pairs for distributional approximation. Specifically, 
for W the random variable of interest and Z having the exponential distri- 
bution, we want to bound |E/i(Vy) — Eh(Z)\ for functions h in some prede- 
termined family of test functions (here, twice differentiable functions h with 
\\h'\\, \\h"\\ < 00). Typically, this program has three components. 

1. Define a characterizing operator A for the exponential distribution 
which has the property that 

EAf(Z) = 

for all / in a large enough class of functions if and only if Z ~ Exp{\). 

2. For functions h in the class of interest, define fh to solve 

(5) Af h (x) = h(x)-Eh(Z). 

3. Using note that 

\Eh(W) - Eh(Z)\ = \EAf h {W)\. 

Now use properties of the solutions fh and the auxiliary randomiza- 
tion of exchangeable pairs to bound this last term. 

The next lemma takes care of Items 1 and 2 and also provides the bounds 
on the solutions fh needed for Item 3 in the program above. The proof of 
Theorem 12.11 is immediately after the proof of the lemma. 
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Lemma 2.2. Let Z be a mean one exponential random variable. If h is a 
function such that the following integrals are well defined, then 

„iu poo 



(6) 



f(w) = f h (w) = / (h(x) - Eh(Z))e- x dx 



solves the differential equation 

(7) wf'(w) -{w- 1)/H = h{w) - Eh(Z). 

If h is absolutely continuous with \\h'\\ < oo, then 

2 X 



(8) 



< 1 + 



\h'\ 



\\f'\\<2\\h'\\. 



If in addition, h'(0) = and h! is absolutely continuous with \\h"\\ < oo ; 
then 

\\f"\\<(5-6/e)\\ti\\+3\\ti'\\. 

Proof. The fact that © solves ([7]) is straightforward to verify. Now, using 
that Kh(Z) = Jq 00 h(x)e~ x dx, we can rewrite ([6]) as 

f( w ) = / h{x)e~ x dx H / e~ x h(x)dx 

w Jw w J 

e w (l - e~ w 



(9) 



w 



h(x)e x dx + 



IV 



h(x)e x dx. 



(x+1) 



To prove ([8]), first note that since translating h by a constant leaves / un- 
changed, we may (and do) assume without loss of generality that h(0) = 0, 
so that h{x) < \\h'\\\x\. Using this fact and also that J xe~ x dx 
in the equality below, we find 

' e w (l - r 

l/HI < II^T 

- \\h'\ 



w 



oo i 

xe~ x dx H 

w 



xe x dx 



(1 



'){w + 2) 



IV 



To bound this last expression, we compare derivatives to find 

(l-e- w )(w + 2) -we~ w < (1 + 2/e)to, w > 0, 

which yields the first assertion of ([5]). 

For the second assertion note that ([7]) implies that 

h(w) 



(10) 



w 



[l-w)f(w)+Eh(Z) \ 



w 



Since |/i(x)| < ||/i'|||rc|, the first term of (I10p is bounded in absolute value by 
\\h'\\, so we only need to find an appropriate bound on the term of (|1U|) that 
is in parentheses. We have by ([9j) that 



[l-w)f(w)+Eh(Z) 



(11) 



uu 



1 



1 — w 



IV 



IV Z 



h(x)e x dx 



w- 



H j / h{x)e x dx. 
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One easily checks that 

e w — 1 e w - 1 - w 



w w 2 



> 0. 



Now taking the absolute value of ([TT]) . using the triangle inequality, bounding 
\h(x)\ < \\h'\\\x\, and using J xe~ x dx = —e~ x (x + 1) , we find 

{l-w)f(w)+Eh(Z)\ 



w 



< \\ h '\\ f (w + l)(w + e- w -l) + l-{w + l)e- w \ = , 
~ \ w 2 w 2 J 

which now yields the second assertion of flSJ). 

To prove the final statement of the lemma, take the derivative of (1101) 
using the expression ([TTj) to find 

(12) fH = hM + (™-yhH 

w w z 

(13) H ^ — / h(x)e x dx + ^r / h(x)e x dx. 

w d J w J Q 

To bound these expressions we first note that since h'(0) = h(0) = 0, 

\h(x)\ < min{||/i , |||x|,||/i"||x 2 /2}, and \h'(x)\ < \\h"\\\x\ 
and in particular, |/i(x)| is bounded above by both terms appearing in the 



minimum. Thus, the absolute value of the right hand side of <\12h is bounded 
above by 

\\h"\\+mm{\w/2 - l|||^"||,|l-2/u;|||h / ||} < \\h"\\ + max{||^||, \\h"\\}, 

< 2\\h"\\ + \\h'\\ 

where we have used that min{|w/2 — 1|, |1 — 2/w\} < 1. 

We bound the second term (|13|) differently according to w > 1 or w < 1. 
Suppose that w > 1. Then note that (w 2 — 2w + 2)e w > e w > 2. Using that 
\h(x)\ < \\h'\\\x\ and J xe~ x dx = —e~ x (x + 1), we find the absolute value of 
(JT3J) is bounded above by 



\ti\ 



((w 2 -2w + 2)e w - 2)(w + l)e~ w + 2(1 - (w + l)e 



w 3 



, w 3 - w 2 + 4 - 4(w + l)e~ w ) 
w 3 

(M) < 11,^+3(1-^ + 1)^ 

w 6 

where we have used that 1 — w 2 < 0. By comparing derivatives we find 

l-(w + l)e~ w < (l-2/e)w 3 , w > 1, 

so that (fT4"J) (and hence ([T3]) ) is bounded above by (4 — 6/e)||/t'|| for w > 1. 
If < w < 1, then 

(w 2 -2w + 2)e w > {w 2 - 2w + 2) (l + w + ^) = > 2 - 
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Using that \h(x)\ < \\h"\\x 2 /2 and / x 2 e x dx = — e x (2 + 2x + x 2 ), we find 
the absolute value of f)13[> is bounded above by 



\\h" 
(15) 



{(w 2 -2w + 2)e w -2)(2 + 2w + w 2 )e- w 2-(2 + 2w + w 2 )e 



\h" 



2w s 

4e~ w '{2 + 2w + w 2 ) 



+ 



2w 3 



Again by comparing derivatives we find 

w A + 8 - 4e~ w (2 + 2w + w 2 ) < 2w 3 , 0<w<l, 

so that (fT5|) (and hence (fT3|) ) is bounded above by \\h"\\ for < w < 1. □ 

Proof of Theorem \2.1\ We show that for h as in theorem, |E/i(lU) — Eh{Z)\ is 
appropriately bounded. We would like to follow the program outlined at the 
beginning of the section, but in order to apply the bounds of Lemma [2. 2\ we 
must have /i'(0) = 0, which is not assumed in Theorem 12.11 We circumvent 
this problem by replacing h with h(x) = h(x) — xh'(0), and we have 

\Eh(W) - Eh(Z)\ < \Kh(W) - Eh(Z)\ + |h'(0)||EW - EZ\ 

(16) < \Eh(W) - Eh(Z)\ + ||/t'|||EW - 1|. 



In order to bound |E/i(VF) — Eh(Z)\, we use Lemma 12.21 in conjunction with 
Item 3 of the program outlined at the beginning of this section to show that 
the absolute value of 

(17) E[Wf'(W)-(W -l)f(W)} 

is appropriately bounded, where / satisfies ([7]) with h replaced by h. 
Using exchangeability and the linearity condition ([2]), we observe that 

E [(W' - W)(f(W) - f(W'))] = 2E[f(W)(W - W)} 

= -2aE[(W - l)f(W)] + 2E[Rf(W)], 

so that ([TT]) is equal to 

E[Wf{W)\ - (2a)- 1 E[(^ / - W)(f(W) - f{W))\ - a - l E[Rf{W)}. 
We rewrite this expression as 

E[(W -W) 2 \F\ 



E 



f'(W) W 



- E 



(W - W) 



2a 



2a 

w'~w 



[f'(W + t)-f'{W)]dt 



o 



a^ElRfiW)}. 



Now taking the absolute value of this last expression, we find that (H 
absolute value is bounded above by 

E[(W -W) 2 \F] 



m 



ll/'P 



W 



+ 



IE 



2a 

\W'-W\ 
2a 



w'-w 



\t\dt 



+ 



E\R\ 
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The result now easily follows after applying the bounds of Lemma 12.21 with 
h replaced by h, noting that \\h"\\ = \\h"\\ and \\h'\\ = \\ti - h'(0)\\ < 2\\h'\\, 
and recalling (|16jl . □ 

We are now in a position to prove Theorem II .li First define the function 
for t,x > 0, and 5 > 0, 

' 1, x <t-S 



(18) h t! s(x) 



1 _ 2 (x-t+S)^ t _ S <x<t-6/2 

2 -^, t-5/2<x<t 
0, x > t 



The next lemma states some important facts regarding the use of ht § in 
our framework. 



Lemma 2.3. Ift>0, 5 > 0, and h t $ is defined by (|18p . then 
||MI = 1 > \\K4 = 2/5, \\h'l 5 \\=A/5 2 . 

If W > is a random variable and Z has the exponential distribution with 
mean one, then 

(19) d K (W, Z) < sup \Eht tS (W) - Eht, s {Z)\ + 5/2. 

Proof. The first assertion follows from direct computation. For the second, 
note that 

V(W < t) - P(Z < t) < Eht +s ,s(W) - P(Z < t) 

= Eh t+ s,8(W) - Eh t +sA z ) + ®h t+s ,s(Z) - P(Z < t) 

rt+5 

< \Eh t+s , s (W) - Eh t+ s iS (Z)\ + J h t+ s4x)e- x dx. 
Since e~ x < 1 for x > 0, we find by direct computation that 

t+8 rt+5 

h t+ 8,s(.x)e~ x dx < j h t+ s,s(x)dx = 5/2. 
Taking supremums, we have shown 

(20) sup[E(W < t) - P(Z < t)} < sup \Eh t ,s(W) - Eh t ,s(Z)\ + 5/2. 

t>0 t>0 
A similar argument starting from 

P(Z < t) - F(W <t)< P(Z < t) - E\ S {W) 
establishes ([20]) with the left hand side replaced by 

sup[P(Z < t)-¥(W < t)], 

t>o 

which proves the lemma. □ 

Proof of Theorem [7771 First apply Theorem 12.11 with h replaced by hts to 
obtain a bound on sup t>0 \Eh t) $(W) — Eht t s(Z)\. From this point, the result 
follows from the bounds of Lemma 12.31 and (1191). □ 



s 
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3. Exponential approximation of \Tr(U)\ 2 

The main purpose of this section is to prove the following result. 

Theorem 3.1. Let W = \Tr(U)\ 2 , where U is from the Haar measure of 
U(n,C). Then for n > 8, the Kolmogorov distance between W and an expo- 
nential with mean one is at most 

To construct an exchangeable pair to be used in our application, we use 
the heat kernel of U(n,C). This has proved useful in other Stein's method 
problems about random matrices 0, 0, 0- See 0, @ for a detailed 



discussion of heat kernels on compact Lie groups. The papers [Ha], [la], 2C] 

3 



illustrate combinatorial uses of heat kernels on compact Lie groups, and 
also discusses the use of the heat kernel for finite groups. 

The heat kernel on a compact Lie group G is defined by setting for x,y £ G 
and t > 0, 

(21) K(t, x,y) = Y t e-^M^Mlj), 

n>0 

where the X n are the eigenvalues of the Laplacian repeated according to 
multiplicity, and the <j> n are an orthonormal basis of eigenfunctions of L 2 (G); 
these can be taken to be the irreducible characters of G. 

We use the following properties of the heat kernel, where A denotes the 



Laplacian of G. Part 1 of Lemma 13.21 is from page 198 of 12|]. Part 2 of 
Lemma 13.21 is immediate from the expansion (|21|) . Part 3 of Lemma 13.21 is 
Lemma 2.5 of 

Lemma 3.2. Let G be a compact Lie group, x,y S G, and t>0. 

(1) K(t,x,y) converges and is non-negative for all x,y,t. 

(2) fy^Q K(t,x,y)dy = 1, where the integration is with respect to Haar 
measure of G. 

(3) For smooth (J), as t — > 0, one has that 

K(t, x, y)<^{y)dy = <f>(x) + t(A<f>)(x) + 0{t 2 ). 

The symmetry in x and y of K (t, x, y) shows that the heat kernel is a 
reversible Markov process with respect to the Haar measure of G. It is a 
standard fact jH] , |27j that reversible Markov processes lead to exchangeable 
pairs Namely suppose one has a Markov chain with transition 

probabilities K(x, y) on a state space X, and that the Markov chain is 
reversible with respect to a probability distribution ir on X. Then given 
a function / on X, if one lets W = f{x) where x is chosen from ir and 
W = f{x') where x' is obtained by moving from x according to K(x,y), 
then (W, W) is an exchangeable pair. In the special case of the heat kernel 
on a compact Lie group G, given a function / on G, one can construct an 
exchangeable pair (W, W) by letting W = f(U) where U is chosen from 
Haar measure, and W = f(U'), where U' is obtained by moving time t from 
U via the heat kernel. To define the exchangeable pair (W, W) used in this 
paper, we further specialize by setting f(U) = \Tr(U)\ 2 . 
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If A is an integer partition, and rrij denotes the multiplicity of part j in A, 
we define p x (U) = ^Tr^)^ . For example, p 5 ,3, 3 (f7) = Tr(U 5 )Tr(U 3 ) 2 . 
Typically we suppress the U and use the notation p\. 

The next three lemmas are from Rains [20|; here V/ • Vg is defined by 

Vf-Vg = ^[A(fg)-gAf-fAg}. 
Lemma 3.3. Aur n )Pl = ~ n Pi- 

Lemma 3.4. For all integers k and I (not necessarily positive), and unitary 
U, 

(V Pk (U))-(V Pl (U)) = -kl- Pk+l (U). 
Lemma 3.5. For all unitary matrices U and class functions fx, - ■ ■ , fk 

a ( n mu) 



Ki<k 



l<i<fc / \l<«<fc v ; l<i<j<k 



A/i(tQ v (V/i(^))-(V/j(^) ) 



The final lemma is a moment computation from 0]. 

Lemma 3.6. Let U be Haar distributed on U(n,C). Let (ai,-- - ,a&) and 
(&!,••• , &&) 6e vectors of non-negative integers. Then one has that for all 

n > Ei=l( a » + 6 i)> 



E 



JJrr(^>Tr(^') fc 



^;fl/'""r- 



Throughout we let W(U) = \Tr(U)\ 2 = Pl (U) Pl {U). 

Lemma [3T71 computes the conditional expectation E[W — W|C/]. 

Lemma 3.7. 

E[W' - W|L/] = 2nt(l - W) + 0{t 2 ). 
Proof. Applying part 3 of Lemma 13.21 
E[W'\U] = W + tA( Pl pT) + 0(t 2 ) 

= W + t[pjA( Pl ) + Pl A(pJ) + 2(Vpi) • (VpT)] + 0(t 2 ) 
= W + t[-2n PlP J + 2n] + 0{t 2 ). 

The second equality was Lemma 13.51 an d t ne final equality used Lemmas 
31and[331 □ 



Lemma ESI computes E[(W - W) 2 \U]. 
Lemma 3.8. 

E[(W' - W) 2 \U] = t[-2p 2P TJ - 2p5 Pl>1 + AnW] + 0(t 2 ). 



10 
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Proof. Clearly 

E[(W' - W) 2 \U] = E[(W') 2 \U] - 2WE[W'\U] + W 2 . 
By part 3 of Lemma 13.21 

E[(W') 2 \U] = W 2 + tA[p 1AP T^} + 0{t 2 ). 
Using Lemma 13,51 and then Lemmas 13,31 and 13,41 one computes that 
A[pi,ipTj] 



2Api 2ApT 2Vpi • Vpi 2VpI • Vpl 8Vpi • VpT 



Pi Pi Pi,i Pi,i PiPi 

= -4npi,ipiJ - 2p 2 p7J - 2p2Pi,i + 8npipT. 

Thus 

E[(W') 2 |C(1 = W 2 + t [-4npi,ipiJ - 2p 2 pTT - 2^pi,i + 8npipl] + 0(i 2 )- 
By Lemma 13.71 

— 2WTE[W'|IT| = -2W 2 + t[-AnW + 4ratf/ 2 ] + 0(t 2 ). 

Thus 

E[(W') 2 |#] - 2WTE[W'|f7] + W 2 = t[-2p 2 pTJ - 2paPl,i + 4nW] + 0(t 2 ). 

□ 

Next we compute expected values of low order moments of W — W. 

Lemma 3.9. Suppose that n > 8. Then 

(1) E(W - W) 2 = Ant + 0(i 2 ). 

(2) E(W — W) 4 = 0(t 2 ). 

(3) E\W -W\ 3 = 0{t 3 / 2 ). 

Proof. Lemma 13.81 implies that 

E{W' - W) 2 = tE [-2p 2 pTJ - 2pzpi,i + 4nVF] + 0(t 2 ). 

By Lemma [3~6l E[p2pTj] = 0, Efp^p^i] = 0, and E[W] = 1; the first part of 
the lemma follows. 

For part 2, first note that since 

E[(W'-W) 4 ] = E(W 4 )-AE{W 3 W')+m<[W 2 {W') 2 ]-AE[W{W') 3 ]+E[{W')% 

exchangeability of (W, W) gives that 

E(W' -Wf = 2E(W A ) - 8E(W 3 W') + 6E[W 2 {W') 2 ] 

= 2E(W 4 ) - 8E[W 3 E[W'\U}] + 6E[W 2 E[(W") 2 |^]]- 

By Lemma 13.71 
-8E[VF 3 E[W"|tf]] = -8E[W 3 (W + t(2n - 2nW) + 0(t 2 ))] 

= -8E(W 4 ) + tE[-l6nW 3 + WnW*} + 0(t 2 ) 
= -8E(W A ) + in[-16(6) + 16(24)] + 0(t 2 ) 
= -8E(VF 4 ) + 288tn + 0(t 2 ), 
where the penultimate equality used Lemma 13.61 
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By the proof of Lemma 13,81 an d then Lemma [ 

6E[W 2 E[(W') 2 \U}] 

= 6E [W 2 (W 2 + t[-AnW 2 - 2p 2 pJJ - 2pip M + 8nW] + 0(t 2 ))} 

= 6E[W 4 ] + iraE[-24W 4 + A8W 3 } + 0(t 2 ) 

= 6E[W 4 ] + tn[-24(24) + 48(6)] + 0{t 2 ) 

= 6E[W 4 ]- 288tn + 0{t 2 ). 

Summarizing, it follows that 

E(W - Wf = 2E{W A ) - 8E[W 3 E[W'\U]] + 6E[W 2 E[(W') 2 \U]] 

= 0(t 2 ), 

proving part 2 of the lemma. 

For part 3 of the lemma, one uses the Cauchy-Schwarz inequality to obtain 
that 

e\w' - w\ 3 < yjE{w - w) 2 e(w - wy. 

Part 3 then follows from parts 1 and 2 of the lemma. 

Now we proceed to the proof of Theorem 13.11 

Proof of Theorem \3.1\ By Lemma 13.71 one can apply Theorem 11.11 with a 
2nt. By Lemma 13.81 an d the triangle inequality, 

E[(W - W) 2 \W] 



□ 



E 



E 



W 
W 



2a 

t[-2p 2 pTJ - mpi,i + AnW] 0{t 2 ) 



Ant 



Ant 



— E|p 2 pi,i + P2P\,i\ + 0(t) 
2n 



< 



2n 



E(P2,2P1, 1,1,1 + 2p 2 ,l,lP2,l,l + P2,2Pl,l,l,l) + 0(t). 



By Lemma [Ml this is ±§ + 0(t); letting t -)• gives an upper bound 



E 



W 



E[(W -W) 2 \W] 



2a 



< 



V2 



n 



The second term in Theorem II .11 is since by Lemma 13.61 E(W) = 1. 
To bound E \ w '~ w \\ note by Lemma[lL9]that E|Ty'-Ty| 3 = 0(t 3 / 2 ). Since 

a = 2nt, the term — tends to as t — > 0. 

Finally, note from Lemma 13.71 that R = 0(t 2 ). Since a = Ant, it follows 
that 

E\R\ < ^E(R{WY) 



0(t) 

a a 

tends to as t — > 0. 

Summarizing, by letting t — > 0, Theorem 11.11 implies that for 5 > 0, 

ol K (W,Z)<^ + 5/2. 
on 

Choosing 5 = A ■ 2 1 / 4 /y / n, yields the claimed result. 



□ 
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Remarks: 

(1) The moments of the random variable \Tr(U)\ 2 have a combinatorial 



interpretation. Indeed, from 2l| one has for all positive integers I, n 
that 

|2n 



\L n < I) = 1 / |rr(C7)| 5 



'U(l,C) 

Here L n is the length of the longest subsequence of a random per- 
mutation on n symbols. 
(2) The technique used in this section can be used to prove that for 
positive integers k, \Tr(U k )\ 2 /k tends to an exponential with mean 
1, for U a Haar distributed unitary matrix from U(n, C), as n — > oo. 
The bookkeeping is quite tedious, so we do not carry this out. 
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